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JL.  iBtrBdufitign 

(hrtrall,  the  field  of  Inforaetloo  retrieval  Is  already  sore  aware  than 
■any  other  fields  of  the  relevance  of  artificial  Intelligence  (AI)  [1-6]. 
lOnetheless  there  raaaln  exciting  applications  of  artificial  Intelligence  that 
have  been  so  far  overlooked.  In  this  paper  we  will  point  out  soae  of  the  ways 
artificial  Intelligence  sight  Influence  the  field  of  Informtlon  retrieval.  We 
will  then  exaalne  one  application  In  more  detail  to  discover  the  kind  of 
technical  probless  Involved  In  Ita  fruitful  exploitation. 

• 

Before  proceeding.  It  la  Important  to  Interject  a  note  of  caution.  While 
the  proBlse  of  artificial  Intelligence  Is  Indeed  bright,  the  tine  of  complete 
fulfUlaent  of  Its  proalse  Is  a  long  way  off.  Of  course,  soae  of  the  expected 
contributions  are  shorter  term  than  others.  However,  the  acre  difficult  prob¬ 
lems  will  fall  only  after  a  good  deal  of  basic  research  Is  accomplished. 
Artificial  Intelligence  researchers  have.  In  the  past,  been  culpable  of  what 
oan  most  charitably  be  described  as  over-optimism  [7,8].  This  naivete  on  the 
part  of  even  the  most  respected  of  researchers  stemmed  from  the  profound 
subtleties  underlying  Intelligent  behavior.  The  problem  Is  compounded  by  the 
fact  that  soae  of  the  most  difficult  of  Intelligent  behavior  (l.e.  common 
sense)  seems  Intuitively  easy. 

2.  Catesoriea  ^  H  Annllcatlona 

Artificial  Intelligence  applications  to  Information  retrieval  fall  Into 
four  broad  categories:  (1)  human-databasa  Interfaces,  (2)  conceptual  Indexing, 
(3)  autcmatlc  data  entry,  and  (A)  active  memory  techniques.  In  the  remainder 
of  this  section  we  will  describe  each  of  these  areas  briefly. 

Artificial  Intelligence  applications  In  these  categories  are  In  different 
stages  of  realisation.  There  have  already  been  several  information  retrieval 
systems  written  using  some  of  the  A1  applications  to  be  mentioned.  Other  AI 
applications  have  not  yet  found  their  way  Into  the  information  retrieval  com¬ 
munity.  All  of  the  AI  applications,  however,  must  still  be  regarded  as  exper¬ 
imental  and  research  oriented.  It  is  yet  too  early  to  expect  more  than  the 
the  most  meager  practical  benefits  from  AI. 

The  first  category,  artificial  intelligence  Interfaces  between  humans  and 
databases.  Is  Involved  with  making  computers  generally  more  accessible  to 
untrained  people.  Natural  language  processing  work  falls  Into  this  category. 
Imagine,  for  example,  a  database  system  that  could  be  conversed  with,  much  the 
same  way  as  a  user  might  Interact  with  an  extremely  well  Informed  human.  We 
will  examine  this  category  In  more  detail  In  the  next  section. 

The  second  category,  conceptual  Indexing,  attempts  to  organise  a  database 
In  terms  of  the  —anine  of  Its  entries.  In  such  a  database  the  system  must,  in 
some  sense,  »undarMtandv  Its  entries.  Queries,  specified  In  the  sane  meaning 
representation,  are  then  used  to  Index  Into  the  database.  To  the  extent  that 
this  is  successful,  false  positive  database  responses  are  eliminated.  The 
meaning  query  represents  precisely  the  user's  conceptual  question.  It  is  not 
an  approximation  to  the  users  question  as  are  current  query  languages.  Only 
those  database  Items  which  satisfy  the  meaning  query  are  retrieved.  Of 
oourse,  there  Is  the  possibility  that  the  user  himself  oannot  formulate 
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pr«ola«ly  lAat  «a  wanta.  Thla,  however,  ia  a  different  theoretical  problem 
which  ought  to  be  dealt  with  in  the  firat  category,  intelligent  uaer  inter- 
faoea. 

Huoh  oonoeptual  indexing  work  ia  motivated  by  payohologloal  oonaldera- 
tlona.  There  ia  an  intereating  example  (due  to  Don  Norman)  that  typifies 
human  databaae  aearoh  powera: 

1)  Have  you  ever  ahaken  handa  with  Preaident  Lincoln? 

« 

People,  of  oourae,  reapond  ■No.*  The  intereating  aapeot  ia  how  faat  people 
are  able  to  reapond.  Not  only  are  aubjeota  faat,  they  are  very  certain  about 
the  oorreotneaa  of  the  anawer.  Subjeota  never  obange  their  wind  if  given  more 
time  to  think  it  over.  One  might  firat  hypothealze  that  people  are  able  to 
oonatruot  an  easy  proof,  by  oomparlng  their  own  birth  date  with  a  gueaa  at 
when  Lincoln  died,  thua  ahowlng  that  ahaking  Lincoln* a  hand  would  be  Inpoaal- 
ble.  Thia  would  be  a  difficult  enough  prooeaa  to  implement  on  a  computer. 
However,  people  are  performing  in  a  much  more  aubtle  and  complex  way.  Con- 
alder: 

2)  Have  you  ever  ahaken  handa  with  Preaident  Carter? 

A  negative  reaponae  ia  not  provable  by  the  aame  method  alnoe  Carter  is 
still  living.  let,  people  are  also  very  fast  in  answering  this  query  even  if 
they  answer  in  the  negative.  They  are  also  very  sure  of  the  oorreotneaa. 
Clearly,  human  information  is  organized  in  a  very  interesting  way.  If  this 
organization  could  be  captured  in  an  information  retrieval  aystem  that  system 
would  have  interesting  capabilities  far  surpassing  ourrent  systems. 

Two  notable  attempts  to  capture  certain  aspects  of  human  memory  organiza¬ 
tion  are  IPP  [9]  end  CTNUS  [10].  Both  systems  build  on  work  of  Sobank  [11], 
and  both  view  learning  as  an  inseparable  component  of  memory  management.  IFF 
reads  and  remembers  news  stories  about  terrorism.  The  oonoeptual  content  of 
an  input  article  is  stored  away  in  memory.  The  process  of  storage  also  stimu¬ 
lates  the  system  to  form  generalizations  based  on  comparisons  with  what  has 
been  stored  in  memory  previously.  In  some  sense,  the  memory  index  of  a  new 
entry  is  dynamically  constructed  to  be  the  tests  of  aimilarltles  and  differ- 
enoes  to  other  items  in  memory. 

CTNUS  ia  a  memory  model  baaed  primarily  on  the  professional  life  of  Cyrus 
Vance,  ax-Secretary  of  State  under  Carter.  Background  information  about 
Secretary  Vance  was  programmed  in.  Nepresentatlons  of  news  reports  about 
Vanoa'a  aotlvitles  given  to  the  system  are  organised  and  retrievable  on  oon- 
oaptual  grounds.  CTNUS  could  answer  questions  like: 

3)  Bow  many  times  baa  Vanoe  met  with  Gromyko? 

CTNUS*  knowledge  is  organised  around  a  time-line-like  data  structure  called  an 
arm.  Different  eras  are  used  to  organise  events  from  different,  nearly- 
aeparahle  aegmenta  of  Vanoe* a  life.  The  above  ayatem,  alnoe  it  is  oonoerned 
with  Vanoe  qua  Secretary  of  State,  would  aearoh  that  era  first. 

There  are  two  recent  and  very  interesting  books  on  organising  omqputer 
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[13]. 


In  huun-llke  ways. 


Thase  nr*  by  Roger  Sohank  [12]  and  Don  Homan 


The  third  category  la  autoaatlo  data  entry.  Many  databasea,  eapeclally 
doouaent  retrieval  ayateaa,  are  dynaalo.  Moat  often  Itena  auat  be  added  to  an 
existing  database  although  soae  applications  require  deletion  of  old  elenents 
as  well.  Using  conventional  database  organizations  this  prooesst  uhlle  possi¬ 
bly  tlae  oonsunlng,  Is  relatively  straight  forward.  However*  If  we  adopt  a 
oonoeptually  organized  database  of  the  previous  category*  this  Is  not  the 
ease.  For  such  databases*  a  new  Item  mat  be  napped  Into  a  neanlng  represen¬ 
tation  befoi'e  It  oan  be  added  to  the  database.  The  neanlng  representation  oan* 
of  course*  be  coded  by  hunans.  There  are  several  obvious  and  Insumountable 
obstaoles  to  hand  oodlng  all  of  the  new  data.  First*  It  Is  prohibitively  tine 
oonsunlng  as  well  as  being  tedious  and*  therefore*  error-prone.  Second*  given 
current  neanlng  languages,  oonslstency  Is  Inposslble  to  Insure.  With  Incon¬ 
sistent  oodlng*  database  Integrity  Is  ompronlsed  and  the  advantages  of  oon- 
oeptual  organization  are  lost.  The  Ideal  aolutlon  Is  to  have  an  artificial 
Intelligence  systen  read  and  understand  text  to  be  added  to  the  database. 
■Understanding"  here  Is  defined  as  napping  fron  natural  language  text  to  a 
neanlng  representation.  The  neanlng  representation  oan  then  be  used  to  Index 
the  original  text  Iten.  Thus*  autonatlo  data  entry  nalntains  a  dynanlc  oon¬ 
oeptually  organized  database  with  no  hunan  Intervention. 

The  only  systen  I  an  aware  of  that  attenpted  to  oonblne  conceptual  organ¬ 
ization  of  nenory  with  autonatlo  data  entry  Is  CXFR  [14].  While  one  oan 
easily  Inaglne  systens  that  autonatloally  update  a  non-oonoeptually  based 
nenory  (for  exsnple*  using  key  words)*  these  are  of  little  Interest  fron  an 
artificial  Intelligence  point  of  view.  There  have  been  other  artificial 
Intelligence  systens  run  on  unedited  real-world  Inputs.  However*  the  Inputs 
are  screened  by  a  hunan  to  ellnlnate  those  outside  the  progran's  donaln  of 
expertise.  This  significantly  changes  the  spirit  of  the  task. 

CXFR  resulted  fron  oonblnlng  the  FRUMP  systen  [15]  with  the  CIRUS  [10] 
systen.  FRUMP  reads  the  UPl  newswlre  Inputted  directly  to  the  oonputer.  The 
level  of  understanding  aohleved  by  FRUMP  Is  shallow  due  to  the  large  anount  of 
donaln  knowledge  required.  Its  understanding  approxinates  that  of  a  hunan 
skinning  a  newspaper  article  qulokly.  Its  knowledge  Is  confined  to  63  topic 
areas.  It  oan  only  understand  news  articles  oonoemlng  one  of  these  topics. 
The  topics  range  fron  earthquakes  and  floods  to  trade  agreenents  and  wars.  In 
partloular  It  knows  about  dlplonatlo  visits*  negotiations*  etc.  These  are 
precisely  the  situations  In  which  Secretary  Fanoe  participated  nost.  The  out¬ 
put  of  the  FRUMP  systen  Is  an  "understood”  (that  Is*  oonoeptual)  representa¬ 
tion  of  the  Input.  All  representations  oontalnlng  a  referenoe  to  the  concept 
for  fanoe  are  given  to  the  CIRUS  systen  which  then  organised  then  In  Its 
nsnory.  Thus*  when  functioning  properly*  a  news  article  reporting  a  Fanoe 
trip  to  Vest  Oemany  would  be  understood  and  Inserted  Into  the  database.  A 
user  oould  ask  the  systen  "Where  Is  Fanoe  now"  and  "Why  Is  he  there"  and 
reoelve  an  up-to-the-nlnute  reply. 

It  should  be  stressed  that  the  day  to  day  perfomanoe  of  the  systen  is 
fsr  fron  flswless.  Many  teohnloal  problens  were  discovered  and  addressed  In 
oonstruotlng  the  systen  snd  nany  nore  In  analysing  why  problen  Inputs  were 
Inoorreotly  processed.  In  all  fslmess*  however,  CXFR  Is  intended  as  a 


rM«aroh  toolt  not  as  n  produotlon  lovol  ooaputer  nystw.  Vben  it  fails  in 
Intaraating  ways  CTFR  la  doing  exactly  vhat  it  vaa  Intended  to  do. 

The  fourth  oategory  la  nade  up  of  a  broad  range  of  artlfleial  Intelll- 
genoe  reaearoh  that  night  be  teraed  *aotive  nenory"  approaches  to  Infomatlon 
retrlOYal.  By  "aotlye  nenory*  I  naan  a  system  in  lAioh  the  aenory  itself 
plays  an  act Its  role  in  the  update  and  retrieval  functions.  For  exanple,  con¬ 
sider  a  hunan  aenory.  Buaan  aenories  are  known  to  be  reconstructive  [16]. 
That  Ist  aertaln  facta  are  not  stored.  Rather,  they  are  reconstructed  on 
danaad  from  pther  Infomatlon.  Clearly  a  huaan  aenory  with  all  Its  frailties 
and  fallings  Is  Insufficient  for  aany  Inforaation  retrieval  applications. 
Bowever,  the  notion  opens  the  possibility  of.  In  sene  aore  fundaaental  way, 
storing  rules  that  characterise  data  elwents  rather  than  storing  all  of  the 
data  aleaents  Individually.  Carrying  this  a  step  further,  an  active  aenory 
night  compute  other  functions  as  well.  In  particular,  enforcing  database  con¬ 
sistency  or  adding  true  but  unaentloned  Inferences  nl^t  be  done  by  the  data¬ 
base  Itself. 

An  obvious  way  to  proceed  Is  to  Incorporate  a  theoraa  prover  Into  the 
database  systen.  Queries  oould  be  answered  by  first  looking  for  the  answer 
explicitly,  and.  If  that  failed,  trying  to  prove  the  query  based  on  other 
Infomatlon  In  the  database^  The  aost  straight  forward  application  of  this  is 
the  notion  of  deductive  jUlg  retrieval.  1  nuabar  ^  gugh  avataaa  Juxft 
conatrueted  [17,18,19].  Charniak,  Riesbeck  and  McDermott  [20]  and  Nilsson 
[21]  both  give  an  Introductory  overview  of  how  this  can  be  laplenented  using  a 
first-order  predicate  calculus  theorw  prover.  There  are,  of  course,  probleas 
with  a  normal  predicate  calculus  theorem  prover.  For  example,  membership  In 
the  set  of  true  theoreas  Is,  In  general,  saal-deoldable.  That  Is,  in  general, 
ona  oannot  dlaprove  a  falae  theorea.  One  Interesting  solution  to  this  dlleaaa 
Is  proposed  by  Allen,  Frisch,  and  Lltaan  [22].  They  weaken  the  first-order 
Inference  engine  In  just  the  right  way  so  that  their  theorea  prover  la 
guaranteed  to  halt  on  any  Input. 

Another  problem  Is  that  predicate  calculus  theorems  are  aonotonlo.  That 
Is,  once  a  theorem  la  proved.  It  oannot  later  be  dlaproved  by  the  addition  of 
aore  Information.  This  la  tenable  only  If  the  ayatea  has  ooaplete  knowledge 
of  tts  world.  No  Inoonsistenoles  or  gaps  In  knowledge  are  tolerable.  The 
real  world  of  huaans  la  not  like  this.  People  aost  often  are  not  absolutely 
sure  of  Infomatlon.  Rather,  they  believe  acaething  because  they  find  It  con¬ 
venient  to  do  so  and  It  does  not  conflict  In  Important  ways  with  irtiat  they 
have  previously  believed.  That  la,  they  aake  assuaptions  about  the  world.  If 
these  assuaptions  are  later  judged  to  be  unwarranted,  they  oan  be  given  up. 
Notice,  however,  that  giving  up  an  assuaption  oan  require  giving  up  other 
beliefs  that  were  believed  because  the  original  assuaption  was  believed.  The 
ability  to  *take  back”  a  previously  proved  theorea  Is  beyond  first-order 
pradloate  logic.  This  Is  the  basis  for  adopting  a  non-aonotonlo  logic  [23]. 
Ihiderlylng  auoh  of  this  work  is  the  notion  of  data  dependency  f2A.25.261. 
lafomally,  the  teohnlque  Involves  stwlng  reasons  for  ballevlng  events  in 
aeacry  as  well  as  the  events  theaselves.  If  the  ayatea  dlsoovers  that  a 
atorad  event.  In  faot,  did  not  ooour.  It  oan  undo  all  of  the  structure  that 
was  built  under  the  false  assuaption  that  the  event  did  ooour.  An  Interesting 
applloatlon  of  thla  Is  to  maintain  database  oonslstenoy  (see,  for  exaaple,  the 
truth  aalntenanoe  ayatea  of  Doyle  [25]). 


Another  rolatod  field  is  induetiva  inference.  Keseerohere  in  this  field 
ettenpt  to  oonstruot  generellzatlons  that  oharaoterize  a  set  of  Inputs.  The 
prlaary  aotlvation  is  to  study  learning  and  Infereneei  not  Infomation 
retrieval.  However,  the  area  does  have  implications  for  Inforaatlon  retrieval 
and  one  day  nay  well  form  an  lm>ortant  oomponent  of  most  information  retrieval 
aystema.  If  an  exaot  oharaoterlzatlon  oan  be  oonstruoted,  the  Inputs  need  not 
be  stored.  They  oan  be  derived  from  the  rule  whenever  necessary. 

Let  u&  consider  the  well  known  aystwa  of  Vinston  [27].  Suppose  we  want 
to  know,  for  any  object,  whether  or  not  it  is  an  arob.  This  is  a  ludicrous 
assumption,  'to  be  sure,  but  it  makes  the  problem  tractable  from  an  artificial 
intelligence  point  of  view.  An  information  retrieval  solution  might  be  to  set 
up  a  database  indexed  by  the  name  or  description  of  the  objects.  In  each 
entry  is  stored  whether  or  not  that  object  is  an  arch.  There  must  be  on  entry 
for  each  Object  of  concern.  Thus,  we  have  easily  constructed  a  rather  trivial 
effective  procedure  to  test  for  "archness.*  Winston's  approach  can  be  viewed 
as  a  trainable  "active  memory*  database.  It  tries  to  oonstruot  rules  of  arch¬ 
ness  that  oan  be  applied  to  an  object's  description  to  classify  it  as  an  arch 
or  not.  The  rules  are  oonstruoted  through  analysis  of  a  set  of  training  exam¬ 
ples  selected  by  a  human  teacher.  Each  example  consists  of  an  object  descrip¬ 
tion  and  a  classification  as  to  whether  or  not  it  is  an  arch.  If  the  system 
succeeds  in  constructing  such  rules,  the  system  need  not  actually  store 
entries  for  each  object.  Bather,  the  classification  rules  oan  be  applied  to 
object  descriptions  as  they  are  presented  to  the  system.  The  system  has  a 
kind  of  reconstructive  ability.  In  at  least  one  way  this  method  is  superior 
to  the  oonventlonal  database  approach.  It  oan  classify  objects  never  before 
seen  which  oould  not  possibly  be  contained  in  the  database. 

Beoonstruotion  is  possible  only  if  the  Inputs  exhibit  some  aystematlo 
structures.  However,  most  databases  have  such  systematic  properties,  and 
human  memory  seems  to  take  advantage  of  it.  This  example  Illustrates  the 
trivial  domain  of  "archness."  The  techniques  have  been  applied  to  some  real- 
world  domains  as  well.  Lebowltz  [9]  and  Kolodner  [10]  have  been  discussed 
earlier.  Niehalskl  [28]  has  constructed  a  system  that  learns  characteristics 
of  soybean  diseases  and  oan  be  used  to  diagnose  soybean  problems. 

2.  Problems  j|J^  H  Interfaces 

The  oategory  of  AI  interfaoes  has  received  more  attention  from  the 
artificial  Intelligence  oommunlty  than  have  the  other  areas.  The  ^neral 
problem  attacked  is  to  make  computers  more  aooesslble  to  untrained  users.  The 
following  systems  have  all  been  oonoemed,  to  a  greater  or  lesaer  extent,  with 
facilitating  human  interaction  with  computer  aystems  (possibly  databases) 
through  artificial  intelligence  techniques  [2,A,10,17,22,29,31»33»34»35,36]. 
Nuoh  of  this  work  oonoentrates  on  how  a  computer  oan  be  programmed  to  "under¬ 
stand*  natural  language  oommanda  and  queries. 

The  goal,  of  oourae,  is  to  oonstruot  a  database  system  that  oan  flexibly 
and  sffeotlvely  be  aooeaaed  by  users  with  no  training  in  a  database  query 
language,  and  perhapa  with  no  oomputer  training  at  all.  Thus,  a  user 
Interested  in  legal  preoedenoe  might  aak  a  database  of  legal  oasea  simply: 
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4)  Could  you  olte  all  oases  In  the  last  five  years  in  vbicb  a  husband 
sued  and  was  awarded  child  custody. 

The  user  oan  address  queries  to  the  ooaputer  in  moh  the  sane  Banner  he  would 
ask  a  learned  colleague.  Of  eourae«  one  would  then  like  the  oomputer  to 
respond  in  Buch  the  sane  way  that  a  learned  oolleague  night.  In  this  particu¬ 
lar  exanple,  the  system  might  respond 

5)  There  are  over  2000  such  oases.  Do  you  really  want  them  listed? 

To  which  the  person  responds 

6)  I  meant  only  in  California. 

The  system  says 

7)  There  are  over  500  such  oases.  Do  you  really  want  them  listed? 
Flnsllyy  the  person  sharpens  the  query  with 

8)  Vhat  about  when  tne  man  has  no  steady  InoMie? 
and  is  given  three  relevant  oases. 

This  illustrates  very  well  that  an  untrained  user  might  not  be  able  to 
articulate  his  query.  Indeed,  if  the  user  had  to  deal  with  a  oonventional 
database  he  might  have  done  better.  The  process  of  Inputlng  the  query  via 
some  formal  query  language  may  help  orystallse  the  request  in  the  user*s  mind. 
There  are  several  Important  points  to  note  here.  First,  the  system  engages  in 
a  eonversatlon-llke  interaction  with  the  user.  This  eliminates  the  need  for  a 
preolsely  speoifled  query;  the  query  gradually  forms  out  of  three  distinct 
inputs.  Second,  eaoh  input  itself  raises  typical  but  very  difficult  natural 
language  prooessing  questions.  In  the  first  query  it  is  not  specified  who  was 
sued  or  even  for  what  reason.  Clearly,  any  person  would  assume  that  the 
husband's  wife  is  referred  to  as  the  target  of  the  suit  and  that  the  suit  is 
for  oustody  of  a  ohild  which  is  presumably  the  offspring  of  the  husband  and 
wife.  This  information  is  not  literally  specified  in  the  text.  Literally, 
this  request  is  a  siq>le  oonjunetion.  As  such  it  would  match  the  case  of  John 
who  three  years  ago  sued  his  neighbor  for  backing  into  John’s  oar  and  last 
year  was  divoroed  from  his  wife  and  was  awarded  oustody  of  their  children  by 
the  divorce  judge.  Clearly,  John's  oase  should  not  be  retrieved.  There  is 
also  the  problem  of  context.  The  second  and  third  human  Inputs  are  quite 
meaningless  alone.  They  are  understandable  only  in  terms  of  what  has  gone  on 
before. 

These  and  other  questions  have  been  addressed  by  the  above  aystems.  For 
the  most  part,  these  systems  map  natural  language  queries  onto  some  more  or 
less  oonventional  query  language.  That  is,  the  natural  language  prooessing 
•ystem  is  a  kind  of  front  end  to  a  standard  database  system. 

Ve  will  now  examine  some  of  the  stiokler  problems  that  ariae  in  oon^ 
•truoting  natural  language  prooessing  systems.  This  will  help  Illustrate  the 
eurrent  state  of  artifioial  intelligenoe  researoh  and  explain  why  artificial 


la  not  yot  a  najor  oonponant  of  current  practical  Infomation 
retrieval  ayateaa* 

Tbe  flrat  problem  to  oonaider  la  one  of  grannatloallty.  One  would  like  a 
complete  ayntactlo  grammar  of  Engllab.  Tbla  would  aid  lamenaely  In  prooeaalng 
the  Input  querlea*  It  la  a  problem  becauae,  deapite  reaearohera  beat  efforta 
no  oomplete  grammar  of  bigllah  ezlata.  Hontheleaa,  ayntactlo  paralng  la  a 
popular  flrat  atep  in  natural  language  prooeaalng  ayatema.  Seaearobera  have 
produced  ayntactlo  graamara  that  are  quite  aophlatloated  and  complex  (for 
examplot  aee  Hobba  [37] )•  However^  none  oovera  all  Engllah  oonatructlona. 
Thua,  allowing  unconatralned  Engllah  queriea  la  not  poaaible  if  we  Inalat  that 
a  ayntactlo  parae  be  oonatructed  In  order  to  derive  the  aemantlc  ooaponent. 
The  aemantlo  oomponent  la  made  up  of  the  routines  that  deal  with  meaning. 
These  would  actually  produce  a  query  in  the  query  language. 

Given  this  obstacle,  there  are  two  ways  to  proceed.  The  flrat  la  to  be 
content  with  processing  only  a  subset  of  English.  In  fact,  this  la  not  aa 
limiting  as  It  aeema  at  first.  Host  practical  query  ayatema  would  not  make 
use  of  the  full  flexibility  afforded  by  unconatralned  Engllah  anyway.  Tbe 
llmltatlona  are  primarily  theoretical.  Treating  ayntax  aa  a  flrat  step  in 
processing  the  Input  remains  a  popular  technique.  Augmented  transition  net¬ 
work  parsing  [2,36]  follows  this  paradigm.  There  are,  of  course,  non-ATN 
approaches  that  also  adopt  this  "syntax  first"  strategy  [30,35]. 

The  second  method  of  circumventing  the  lack  of  a  oomplete  Engllah  grammar 
la  to  design  a  system  that  la  primarily  driven  by  semantic  considerations. 
Tbla  method  does  not  require  that  a  oomplete  syntactic  parae  be  produced  prior 
to  aemantlo  prooesslog.  Eatber,  syntactic  processing  la  done  only  trtien  a 
aemantlc  analyser  requests  It.  This  approach  has  also  been  popular  In  the  AI 
oomaunlty  [9»15t33]« 
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A  second,  but  related,  problem  is  that  of  ungrammatical  inputs.  People 
often  treat  fragments  or  phrases  as  sentences.  Buaan  listeners  usually  have 
little  trouble  understanding  these.  Dngraamatloal  Inputs  can  be  very  tricky 
for  natural  language  prooesslng  systems,  however.  Again  there  are  two  possi¬ 
ble  solutions.  The  first  Is  to  anticipate  all  possible  classes  of  ungranati- 
oal  Inputs  In  order  to  write  an  expanded  syntactic  grammar  that  covers  these 
oases  as  well  as  proper  English  queries.  This  approach  was  taken  by  Brown  and 
Burton  [29]  In  their  suooessful  semantic  grammar  work.  Tbe  other  approach, 
treating  syntactic  considerations  as  secondary,  results  In  a  systoi  that  Is 
less  sensitive  to  ungremmatloal  Inputs.  Since  there  is  no  explicit  syntactic 
granmar  there  Is  no  absolute  fora  requirement  that  ungrammatical  Inputs 
violate.  Syntax  Is  still  used,  to  tbe  extent  that  It  Is  needed,  to  aid  In 
aemantlo  prooesslng.  If  this  syntactic  knowledge  Is  violated  the  input  cannot 
be  prooesaed.  However,  the  required  syntax  Is  much  less  than  that  needed  to 
os^^letely  parse  the  Input.  Any  Input  violating  these  meager  syntactic  oon- 
stralnts  would  be  meaningless  to  humans,  too.  It  is  unfair  to  expect  the 
systsk's  understanding  to  surpass  a  human's.  Tbe  system  Is  oompletely  Insen¬ 
sitive  to  alterations  In  the  aon<«essentlsl  syntactic  relations.  On  the  nega¬ 
tive  side,  tbaorc  loally,  *  ms  systams  often  make  no  distinction  at  all 
between  grammatle..  so  Mt  and  wildly  Inoorreot  ayntactlo  oonstruots.  This 
would  be  a  dlsadvait  rge  In  an  Information  retrieval  application  If  ungrammatl- 
oallty  was  used  to  signal  that  tbe  user  might  not  know  himself  idiat  be  wants. 


The  next  problen  we  will  consider  is  establishing  pronoun  referents. 
Vhen  a  user  specifies  a  natural  language  query  containing  a  pronoun,  the 
natural  language  oonponent  aust  resolve  the  pronoun  before  a  well  foraed  data* 
base  query  oan  be  constructed.  Pronoun  resolution  can  be  considered  as  a  oob> 
ponent  of  the  larger  problea  of  anaphora  and  referring  expressions.  However, 
in  the  interest  of  brevity  and  accessibility  we  will  consider  only  a  few 
representative  pronoun  probleas. 

In  the.  slaplest  case,  a  pronoun  stands  in  place  of  another  word  or  phrase 
In  the  input.  For  exaaple,  consider  sentence  (9}* 

9)  John  went  to  bed  after  he  dined. 

The  pronoun  "he”  refers  back  to  the  proper  noun  "J^n."  Ve  interpret  this 
sentence  as  aeaning  that  the  aaae  person  who  ate  also  went  to  bed  and  that  his 
naae  is  John.  An  obvious  and  slaple  (but  unfortunately  wrong)  way  to  process 
this  pronoun  is  to  keep  a  list  of  other  nouns  aentloned  thus  far  in  process¬ 
ing.  Vhen  a  pronoun  is  encountered,  we  search  this  list  for  a  noun  that 
aatohes  in  gender  and  nuaber.  In  this  case,  since  *he”  is  singular  and  aale, 
the  noun  "John”  is  chosen  as  the  referent  rather  than  "bed.”  There  are 
several  laaedlate  probleas  with  this  solution.  Consider 

10)  After  he  dined  John  went  to  bed. 

Here,  again,  "he*  refers  to  ■John.”  However,  the  noun  "John”  is  not  encoun¬ 
tered  until  after  the  pronoun  "he.”  Clearly,  we  cannot  always  insist  that 
pronouns  be  resolved  as  they  are  encountered.  There  is  the  auoh  aore  diffi¬ 
cult  problea  of  which  noun  phrase  to  select  when  aore  than  one  agree  in  gender 
and  nuaber.  It  is  also  not  unooaaon  for  the  pronoun's  referent  to  occur  in  a 
different  sentence  froa  the  pronoun.  See  Charnlak  [30]  for  a  interesting  dis¬ 
cussion  of  this.  Even  without  ooaplicating  things  by  introducing  aany  noun 
phrases  or  aultlple  sentences  we  can  see  that  selecting  a  referent  for  a  pro¬ 
noun  oan  be  subtle  and  difficult  problea.  Consider  sentences  (11)  and  (12). 
Sentence  (11)  is  slailar  to  (9)  end  (10).  Again  the  pronoun  ■be*  refers  to 
■John.”  However,  in  sentence  (12)  which  possesses  aany  slallarlties  to  the 
previous  sentences,  ■be”  cannot  refer  to  ■John.” 

11)  After  John  dined  be  went  to  bed. 

12)  He  went  to  bed  after  John  dined. 

This  ^enoaenon,  idiile  subtle,  is  understood.  The  rule  is  that  the  pro¬ 
noun  subject  of  an  Independent  clause  cannot  find  its  referent  as  the  subject 
of  one  of  its  own  following  dependent  clauses.  Mow  we  will  consider  briefly 
soae  further  probleas  with  pronoun  resolution. 

Consider  the  following  interchange  (13  and  15  are  user  inputs,  14  is  a 
database  response): 

13)  Does  every  salesaan  own  his  own  oar? 

14)  Tea 

15)  Bow  aany  of  then  are  aore  than  five  years  old? 

The  problea  is  the  pronotin  ■tUen”  in  sentence  (15).  In  this  context 


■thoi''  ought  to  be  taken  to  refer  to  the  oars  that  the  aalesnen  own.  This 
reference  Is  oonplioated  by  the  fact  that  there  la  no  previous  literal  phrase 
that  the  pronoun  "them”  stands  for.  In  the  previous  examples,  "be”  stood  for 
"John"  which  was  explicitly  mentioned  elsewhere  In  the  Input.  Here,  there  Is 
no  such  corresponding  phrase  that  means  "the  group  of  oars  that  the  salesmen 
own.”  Furthermore,  there  Is  no  easy  way  to  rule  out  the  referent  "every 
salesman"  for  "them"  which  In  fact  does  explicitly  occur  In  the  sentence.  The 
"every  salesman”  can  be  ruled  out  on  semantic  (or  "meaning")  grounds  by  real¬ 
ising  that*  salesmen  are.  In  general,  much  older  than  five  years  and  that  this 
fact  Is  oomaion  knowledge  which  Is  probably  known  by  the  user.  However,  get¬ 
ting  a  computer  system  to  construct  such  an  Informal  proof  or  even  getting  It 
to  ask  the  right  question  at  the  rl^t  time  without  causing  unmanageable 
side-effects  Is  very  difficult.  In  any  case  this  process  only  rules  out 
"every  salesman"  for  the  referent.  It  says  nothing  about  how  to  construct  the 
correct  referent.  Nash-Vebber  and  Reiter  [32]  call  this  the  problem  of  Impli¬ 
cit  sets.  Clearly  a  good  deal  of  semantic  processing  must  be  done  before  all 
of  the  Input  words  can  be  Interpretted. 

Another  example  showing  how  semantics  can  Influence  pronoun  resolution  Is 
the  following: 

16}  Does  every  salesman  who  has  been  with  the  flra  more  than  two  years 
get  at  least  16](  commission? 

17a)  Yes 

16)  List  them 

In  sentence  (18)  "them”  refers  to  "the  salesmen  who  have  been  with  the 
firm  more  than  two  years  and  get  at  least  commission.”  Here  the  referent 
comes  from  a  previous  Input.  The  problem  Is  sore  subtle  that  that,  however. 
Consider  the  Interchange  slightly  altered: 

16)  Does  every  salesman  who  has  been  with  the  firs  more  than  two  years 
get  at  least  18|  commission? 

17b)  Ho 

18)  List  them 

In  this  version,  the  "then”  In  sentence  (18)  most  likely  refers  to  "the  sales¬ 
men  who  have  been  with  the  flra  more  than  two  years  and  do  NOT  get  at  least 
18$  oonmlsslon. ”  This  Is,  In  some  sense,  the  complement  of  what  the  sane  pro¬ 
noun  referred  to  In  the  previous  Interchange.  Obviously,  the  choice  of  pro¬ 
noun  referents  depends  not  only  on  the  previous  Input  which  contains  the 
referent  but  also  on  Intervening  database  responses. 

There  are  many  other  rwj  difficult  problems  besides  ungrammatical  Input 
and  pronoun  resolution.  A  few  of  the  trickier  problems  are  1}  lexical  ambi¬ 
guity,  2)  conjunction  3)  prepositional  phrase  attachment  and  A)  literally 
Incorrect  Inputs. 

Lexical  ambiguity  has  to  do  with  processing  words  that  oan  have  more  than 
one  meaning.  In  English  It  Is  the  rule  rather  than  the  exception.  One  need 
only  consult  a  dictionary  to  be  convinced  of  this.  It  Is  the  rare  entry  that 
haa  only  one  word  aenae.  Most  words  have  at  least  three  entries  and  some  have 
twenty  or  thirty.  Furthermore,  not  all  meanings  are  the  same  part  of  speech. 
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For  exasple,  oonslder  the  word  "bow.*  My  dictionary  gives  17  aeanlngs.  Some 
are  for  a  verb  (both  transitive  and  Intransitive)  and  some  are  for  a  noun. 
Clearly^  the  woi^  senses  that  the  user  Intended  must  be  selected  by  the  system 
if  it  Is  to  correctly  determine  the  meaning  of  an  Input. 

(injunction  Is  the  use  of  connectives  to  Join  sentence  parts.  The  prob¬ 
lem  Is  that  syntactic  entities  distribute  over  connectives  In  many  different 
ways.  Usually,  these  different  ways  yield  different  sentence  meanings.  For 

example: 

19)  Is  It  time  to  re-order  the  high-voltage  diodes  and  transistors? 

This  Input  has  two  slightly  different  syntactic  parses  corresponding  to 
two  slightly  different  Interpretations.  They  differ  in  whether  or  not  the 
user  has  presupposed  that  the  transistors  are  also  high-voltage.  This  nay 
seen  like  a  small  difference  but  In  the  right  circumstances  It  can  have  a 
large  effect  on  how  the  query  Is  answered.  Suppose,  for  example,  that  there 
are  two  types  of  transistors  In  stock:  normal  transistors  and  high-voltage 
transistors.  A  different  Interpretation  of  the  word  "traiuistors”  may  well 
change  the  response  of  the  system. 

The  solution  must  Involve  the  use  of  domaln-speoifio  semantic  and  prag¬ 
matic  knowledge.  For  example,  suppose  that  the  company  uses  only  one  kind  of 
transistor  -  the  low-voltage  kind.  Clearly,  the  word  ■transistors*  in  (16) 
ought  to  be  understood  as  referring  to  that  Item.  Thus,  the  adjective  "high- 
voltage*  must  be  understood  NOT  to  distribute  to  "transistors."  Notice,  how¬ 
ever,  that  this  means  that  the  actual  syntactic  parse  produced  can  be  influ¬ 
enced  by  knowledge  about  the  contents  of  the  company's  stock  Inventory. 

He  will  now  mention  a  similar  problem,  prepositional  phrase  attachment, 
(insider  sentence  (20). 

20)  Is  It  legal,  in  Arkansas,  to  hang  a  man  with  a  moustache? 

This  Is  an  old  Joke.  The  answer  is  "7es,  but  it  works  better  if  you  use 
a  rope. ■  The  prepositional  phrase  "with  a  moustache"  Is  Initially  interpreted 
as  adjectival  (modifying  "man").  However,  syntax  allows  the  prepositional 
phrase  to  be  attached  to  the  verb  "hang"  Instead.  This  Is  the  reading 
required  by  the  answer.  Syntactically,  both  readings  are  acceptable.  Semant¬ 
ically,  however,  one  Is  far  better.  The  Idea  of  straiigllng  a  man  on  the  gal¬ 
lows  with  his  own  moustache  la  ludicrous.  Again,  high  level  knowledge.  In 
this  case  about  stereotypic  executloi».  Is  necessary  before  a  low  level  syn- 
taotio  parse  can  be  decided  upon. 

Finally,  people  seldMi  say  what  they  mean  and  occasionally  say  nearly  the 
opposite.  Consider  again  sentence  (19)>  The  query  asks  whether  it  Is  time  to 
reorder  "diodes  and  translators."  In  fact,  the  iiser  probably  meant  a  disjunc¬ 
tion  rather  than  a  oonjunctlon.  If  there  were  no  more  transistors  but  suffl- 
olant  diodes  remained,  a  literal  Interpretation  of  the  question  requires  that 
the  database  respond  negatively;  it  is  not  yet  time  to  reorder  them  both. 
Clearly,  a  system  should  respond  to  what  la  meant  and  not  what  is  said.  Con¬ 
sider  another  example  (patterned  after  one  of  Sohank  [3A]): 
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21)  Ar«  the  oaiw  of  tuna  raoeivad  Saptambar  12  still  palatabla? 

Litarally  intarpratad,  tha  anavar  would  ba  *do.*  Cana  ara  not  a  suitable 
food  no  aattar  wbat  they  oontaln  or  whan  they  ware  racalvad.  Clearly »  how- 
wr,  tha  literal  aeanlng  is  not  what  was  intended.  Tha  question  of  palata- 
blllty  is  to  ba  answered  oonoamlng  tha  tuna  inside  tha  oans  and  not  tha  cans 
thaasalvas. 

Kxaaplps  like  these  crop  up  frighteningly  often  in  tha  real  world.  One 
possible  response  to  all  this  sight  ba  *well,  if  people  ara  going  to  talk  to 
oosputarizad  databases,  they  at  least  ought  to  say  what  they  sean."  This 
■Issas  tha  point.  Tha  purpose  of  Introduolng  an  artificial  Intelligence 
natural  language  front  end  to  a  database  is  to  permit  the  user  to  operate  in 
his  own  natlTO  mode.  Ve  want  to  eliminate  the  need  for  specialized  training 
by  allowing  users  to  oommunloate  in  English,  a  language  in  which  they  are 
already  fluent.  To  put  oonstralnts  on  what  English  is  acceptable  and  what  is 
not  violates  the  spirit  of  the  task. 

1.  Conclusion 

I  would  hope  that  the  reader  is  left  with  some  notion  of  the  vast  and 
promising  possibilities  of  incorporating  artificial  intelligence  in  informa¬ 
tion  retrieval  systems.  At  least  equally  Important,  however,  is  the  need  to 
develop  a  realistic  appreciation  of  the  current  state  of  the  field.  Reviewing 
the  previous  sections  I  am  inclined  to  believe  I  have  painted  a  slightly  too 
pessimistic  picture.  But  perhaps  this  is  worthwhile  in  order  to  oounteraot 
the  flashy,  soienoe-flotiony  aide  of  the  field  that  has  been  over-exploited  ^ 
some. 

There  is  little  doubt  that  most  information  retrieval  systems  will  ulti¬ 
mately  oontaln  a  significant  AI  opponent.  Indeed,  a  number  of  current  sys¬ 
tems  already  incorporate  artificial  intelligence  techniques.  Some  of  these 
systems  are  even  oommeroial  products  and  so  are  more  production  oriented. 
Bowever,  they  lack  the  full  generality  and  flexibility  that  one  would  like  in 
such  systems.  The  systems  are  oharaoterized  by  rather  narrow  solutions  that 
degenerate  if  extended  too  broadly.  The  major  problem  remains  lack  of  world 
knowledge.  The  representation  and  organization  of  which  makes  up  a  signifi¬ 
cant  portion  of  current  artificial  intelligence  research. 
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